Discovering Associations in XML Data
نویسندگان
چکیده
Knowledge inference from semi-structured data can utilize frequent sub structures, in addition to frequency of data items. In fact, the working assumption of the present study is that frequent sub-trees of XML data represent sets of tags (objects) that are meaningfully associated. A method for extracting frequent sub-trees from XML data is presented. It uses thresholds on frequencies of paths and on the multiplicity of paths in the data. The frequent sub-trees are extracted and counted in a procedure that has
منابع مشابه
Discovering Entity Correlations between Data Schema via Structural Analysis
At the forefront of data interoperability is the issue of semantic translation; that is, interpretation of the elements, attributes, and values contained in data. Systems which do not adhere to pre-defined semantics in their data representations need to dynamically mediate communication between each other, and an essential part of this mediation is structural analysis of data representations in...
متن کاملStructuring Domain-Specific Text Archives by Deriving a Probabilistic XML DTD
Domain-specific documents often share an inherent, though undocumented structure. This structure should be made explicit to facilitate efficient, structure-based search in archives as well as information integration. Inferring a semantically structured XML DTD for an archive and subsequently transforming its texts into XML documents is a promising method to reach these objectives. Based on the ...
متن کاملExtraction of Semantic XML DTDs from Texts Using Data Mining Techniques
Although composed of unstructured texts, documents contained in textual archives such as public announcements, patient records and annual reports to shareholders often share an inherent though undocumented structure. In order to facilitate efficient, structure-based search in archives and to enable information integration of text collections with related data sources, this inherent structure sh...
متن کاملApply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کاملAutomated Negotiation from Declarative Contract Descriptions
At the forefront of interoperability using XML in an Internet environment is the issue of semantic translation; that is, the ability to properly interpret the elements, attributes, and values contained in an XML file. In many cases, specific domains have standardized the way data are represented in XML. When this does not occur, some type of mediation is required to interpret XML formatted data...
متن کامل